Frame-Wise Dynamic Threshold Based Polyphonic Acoustic Event Detection
نویسندگان
چکیده
Acoustic event detection, the determination of the acoustic event type and the localisation of the event, has been widely applied in many real-world applications. Many works adopt multi-label classification techniques to perform the polyphonic acoustic event detection with a global threshold to detect the active acoustic events. However, the global threshold has to be set manually and is highly dependent on the database being tested. To deal with this, we replaced the fixed threshold method with a frame-wise dynamic threshold approach in this paper. Two novel approaches, namely contour and regressor based dynamic threshold approaches are proposed in this work. Experimental results on the popular TUT Acoustic Scenes 2016 database of polyphonic events demonstrated the superior performance of the proposed approaches.
منابع مشابه
A Dynamical Bayesian Network for Tempo and Polyphonic Pitch Tracking
We present a model for simultaneous tempo and polyphonic pitch tracking. Both these acoustic analysis tasks are difficult and, arguably, no satisfactory solution currently exists for polyphonic pitch tracking. Our model, a form of Dynamical Bayesian Network, embodies a transparent and computationally tractable approach to this acoustic analysis problem. An advantage of our approach is that it p...
متن کاملBidirectional LSTM-HMM Hybrid System for Polyphonic Sound Event Detection
In this study, we propose a new method of polyphonic sound event detection based on a Bidirectional Long Short-Term Memory Hidden Markov Model hybrid system (BLSTM-HMM). We extend the hybrid model of neural network and HMM, which achieved stateof-the-art performance in the field of speech recognition, to the multi-label classification problem. This extension provides an explicit duration model ...
متن کاملGenerative Model Based Polyphonic Music Transcription
In this paper we present a model for simultaneous tempo and polyphonic pitch tracking. Our model, a form of Dynamical Bayesian Network [1], embodies a transparent and computationally tractable approach to this acoustic analysis problem. An advantage of our approach is that it places emphasis on modeling the sound generation procedure. It provides a clear framework in which both high level (cogn...
متن کاملCoupled Sparse Nmf vs. Random Forest Classification for Real Life Acoustic Event Detection
In this paper, we propose two methods for polyphonic Acoustic Event Detection (AED) in real life environments. The first method is based on Coupled Sparse Non-negative Matrix Factorization (CSNMF) of spectral representations and their corresponding class activity annotations. The second method is based on Multi-class Random Forest (MRF) classification of time-frequency patches. We compare the p...
متن کاملMulti-pitch estimation for polyphonic musical signals
Automatic Score Transcription goal is to achieve an score-like (notes pitches through time) representation from musical signals. Reliable pitch extraction methods for monophonic signals exist, but polyphonic signals are much more difficult, often ambiguous, to analyze. We propose a computationally efficient technique for automatic recognition of notes from a polyphonic signal. It looks for corr...
متن کامل